Experiments in Reusability of Grammatical Resources

نویسندگان

  • Doug Arnold
  • Toni Badia
  • Josef van Genabith
  • Stella Markantonatou
  • Stefan Momma
  • Louisa Sadler
  • Paul Schmidt
چکیده

1 Introduction Substantial formal grammatical and lexical resources exist in various NLP systems and in the form of textbook specifications. In the present paper we report on experimental results obtained in manual , semi-antomatic and automatic migration of entire computational or textbook descriptions (as opposed to a more informal reuse of ideas or the design of a single "poly-theoretic" representation) from a variety of formalisms into the ALEP formalism. 1 The choice of ALEP (a comparatively lean, typed feature structure formalism based on rewrite rules) was motivated by the assumption that the study would be most interesting if the target formalism is relatively mainstream without overt ideological commitments to particular grammatical theories. As regards the source formalisms we have attempted migrations of descriptions in HPSG (which uses fully-typed feature structures and has a strong 'non-derivational' flavour), ETS (an un-typed stratificational formalism which essentially uses rewrite rules for feature structures and has run-time non-monotonic devices) and LFG (which is an un-typed constraint and CF-PSG based formalism with extensions such as existential, negative and global well-formedness constraints). the CEC as part of the project ET10/52. Reusability of grammatical resources is an important idea. Practically, it has obvious economic benefits in allowing grammars to be developed cheaply; for the-oreticians it is important in allowing new formalisms to be tested out, quickly and in depth, by providing large-scale grammars. It is timely since substantial computational grammatical resources exist in various NLP systems, and large scale descriptions must be quickly produced if applications are to succeed. Meanwhile, in the CL community, there is a perceptible paradigm shift towards typed feature structure and constraint based systems and, if successful, migration allows such systems to be equipped with large bodies of descriptions drawn from existing resources. In principle, there are two approaches to achieving the reuse of grammatical and lexical resources. The first involves storing or developing resources in some theory neutral representation language, and is probably impossible in the current state of knowledge. In this paper, we focus on reusability through migration-the transfer of linguistic resources (grammatical and lexical descriptions) from one computational formalism into another (a target computational formalism). Migration can be completely manual (as when a linguist attempts to encode the analyses of a particular linguistic theory in some compu-tationally interpreted formalism), semi-automatic or automatic. The starting resource can be a paper description or an implemented, runnable grammar. The literature on migration is thin, …

منابع مشابه

Modularity of grammatical constraints in HPSG-based grammar implementations

This paper is a contribution to the discussion of the choices involved in implementing HPSG-based grammars and their consequences on the modularity and reusability of grammatical resources, a central issue in multi-lingual grammar development. Based on two examples from the English Resource Grammar (Flickinger et al., 2000), the treatment of unbounded dependencies and the analysis of optional a...

متن کامل

An Architecture Sketch of EUROTRA-II

This paper outlines a new architecture for a NLP/MT development environment for the EUROTRA project, which will be fully operational in the 1993-94 time frame. The proposed architecture provides a powerful and flexible platform for extensions and enhancements to the existing EUROTRA translation philosophy and the linguistic work done so far, thus allowing the reusability of existing grammatical...

متن کامل

Evaluating learning resources for reusability: the "dner & learning objects" study

The DNER&LO study gathered data about 27 elearning projects, mapping the categories of content being produced, and approaches to reusability and interoperability. Eighteen were chosen for closer study and evaluation, based on availability of content, and on covering a wide range of content categories. Appropriate reusability evaluation criteria were developed specifically for the study, in four...

متن کامل

Phase I Testbed Description: Requirements and Selection Guidelines

The Application of Reusable Software Components Project has constructed a reuse testbed for conducting software engineering experiments in software reusability. The hardware and system software of the testbed will provide a distributed computing environment with file-server capability for the storage of reusable components and other artifacts of the development process. The testbed will support...

متن کامل

Linear Logic , Proof Nets and Categorial

categorial grammars are not intended as yet another grammatical formalism that would compete with other established formalisms. It should rather be seen as the kernel of a grammatical framework in which other existing grammatical models may be encoded. 4.1.2. Interaction Grammars Interaction Grammars (IGs) are a linguistic formalism that aims at modelling both the syntax and the semantics of na...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993